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Abstract. The Library of Babel, described by Jorge Luis Borges, stores an enormous amount of informa- 
tion. The Library exists ab aeterno. Wikipedia, a free online encyclopaedia, becomes a modern analogue 
of such a Library. Information retrieval and ranking of Wikipedia articles become the challenge of modern 
society. While PageRank highlights very well known nodes with many ingoing links, CheiRank highlights 
very communicative nodes with many outgoing links. In this way the ranking becomes two-dimensional. 
Using CheiRank and PageRank we analyze the properties of two-dimensional ranking of all Wikipedia En- 
glish articles and show that it gives their reliable classification with rich and nontrivial features. Detailed 
studies are done for countries, universities, personalities, physicists, chess players, Dow- Jones companies 
and other categories. 



PACS. 89.75.Fb Structures and organization in complex systems 
trees - 89.20. Hh World Wide Web, Internet 



.75. He Networks and genealogical 



1 Introduction 



The Encyclopedie [T] accumulates the available human 
knowledge making it accessible to all citoyennes. In this 
way the Encyclopedie becomes one of the most powerful 
catalysts of modern development of science and society |2. • 
This process of knowledge transfer becomes enormously 
accelerated with the appearance of Wikipedia [3l, a free 
online encyclopaedia, which current size overcomes 6 mil- 
lions English entries OH] . Wikipedia comes close to Ency- 
clopaedia Britanica [B] in terms of the accuracy of its sci- 
ence entries [5] overcoming the later by far in an enormous 
amount of available information. The classification and 
ranking of this information becomes the great challenge. 
The statistical analysis of directed network generated by 
links between Wikipedia articles [7l[8j|9] established their 
scale-free properties showing certain similarities with the 
World Wide Web and other scale-free networks [lOyili 
fT2l[T3lfT4] . Here we apply a two-dimensional ranking algo- 
rithm, proposed recently [TS], and classify all Wikipedia 
articles in English by their degree of communication and 
popularity. This ranking allows to select articles in a new 
way, giving e.g. more preference to communicative and 
artistic sides of a personality compared to popularity and 
political aspects stressed by the PageRank algorithm [16_. 
With a good reliability the ranking of Wikipedia articles 
reproduces the well established classifications of countries 
[T7], universities [TH], personalities [12], physicists, chess 
players, Dow- Jones companies and other categories. 



The paper is composed as follows: in Section 2 we 
describe CheiRank and PageRank algorithms and apply 
them for two-dimensional ranking of Wikipedia articles, 
in Section 3 the results of such a ranking are discussed 
for various categories of articles, discussion of findings is 
given in Section 4. 



2 CheiRank versus PageRank 

At August 18, 2009 we downloaded from the latest 
English Wikipedia snapshot and by crawling determined 
links between all Ntot = 6855098 entries. We keep for our 
study only entries which are linked with other entries and 
eliminate categories from consideration. After that we ob- 
tain a directed network of = 3282257 articles. The mul- 
tiplicity of links from one article to another is taken into 
account. The distributions of ingoing and outgoing links 
are well described by a power law Win,out{k) oc l/fc/^i".""* 
with the exponents /ii„ — 2.09±0.04 and ^out — 2.76±0.06 
(see Fig. [1]) being in a good agreement with the previ- 
ous studies of Wikipedia [HIHlin] and the World Wide Web 

(WWW) [I1I1I1I22]- 

Due to the similarity with the WWW it is natural 
to construct the Google matrix of Wikipedia using the 
procedure described in [T6lll4) : 



aS^j -t- (1 - a)/N 



(1) 



where the matrix S is obtained by normalizing to unity all 
columns of the adjacency matrix, and replacing columns 
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Fig. 1. (Color online) Distribution Win,oiit(fc) of ingoing (a) 
and outgoing (b) links fc for iV = 3282257 Wikipedia English 
articles. The straight dashed fit line shows the slope with fj>in — 
2.09 ± 0.04 (a) and /lout = 2.76 ± 0.06 (b). 
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In K, In K* 

Fig. 2. (Color online) Dependence of probability of PageRank 
P (red curve) and CheiRank P* (blue curve) on the corre- 
sponding rank indexes K and K* . The straight dashed lines 
show the power law dependence with the slope /3 — 0.92; 0.57 
respectively, corresponding to /3 = l/inin.out — 1). 



with zero elements by N being the network size. 

The damping parameter a in the WWW context describes 
the probability to jump to any node for a random surfer. 
For Wikipedia this parameter can describe the probabil- 
ity to modify an article that affects the overall ranking. 
The value a = 0.85 gives a good classification [T3] for 
WWW and thus we also use this value here. The matrix 
G belongs to the class of Perron- Frobenius operators [13] , 
its largest eigenvalue is A = 1 and other eigenvalues have 
|A| < a. The right eigenvector at A = 1 gives the proba- 
bility P{i) to find a random surfer at site i and is called 
the PageRank. Once the PageRank is found, Wikipedia 
articles are sorted by decreasing P{i), the article rank in 
this index K{i) reflects the article relevance. The PageR- 
ank dependence on K is well described by a power law 
P{K) oc (see Fig. [2]) with /3 « 0.9 that is consistent 

with the relation (3 — l/(/ii„ — 1) corresponding to the 
proportionality of PageRank to its in-degree Wm [H] ■ 

In addition to the PageRank, following the approach 
to Linux Kernel procedure call network proposed recently 
by Chepelianskii [T^, we also consider the ranking of ar- 
ticles obtained from the conjugated Google matrix G* . 




Fig. 3. (Color online) Density distribution W{K,K*) — 
dNi/dKdK* of Wikipedia articles in the plane of PageRank 
and CheiRank indexes (In A', In if*) shown by color with blue 
for minimum and white for maximum (black for zero); (a) red 
points show top 100 countries from 2DRank, yellow pluses show 
top 100 countries from SJR country documentation ranking for 
1996-2008 J/Tj, cyan stars mark 30 Dow- Jones companies |20) : 
(b) red points show top 100 universities from 2DRank, yel- 
low pluses show top 100 universities from Shanghai ranking of 
2009 |18) : (c) green/red points show top 100 personalities from 
PageRank/CheiRank, yellow pluses show top 100 personalities 
from 19. ; (d) green and red points show ranks of 754 physicists, 
red points mark 193 Nobel laureates. 



This matrix is built from the adjacency matrix where all 
link directions are inverted to opposite. After this inver- 
sion the Google matrix G* is consructed in a usual way 
using a = 0.85. The eigenvector P*{i) of G* at A = 1, 
introduced in T5^, allows to chercher I 'information in a 
new way and we call it CheiRank. English spelling of 
this term sounds as Russian phrase which translation is 
"whose rank" . CheiRank gives additional ranking of arti- 
cles in decreasing order of P*{i) with rank index K*{i). 
Our data, shown in Fig. [21 give a power law dependence 
P* oc with /3 « 0.6 corresponding to the relation 

/3 — l/(/ioMt — 1) similar to the one found for the PageR- 
ank. Both ranks are normalized to unity. 

Now all articles (or nodes) can be ordered in decreas- 
ing monotonic order of probability of PageRank P{i) or 
in decreasing monotonic order of probability of CheiRank 
P*(i). In this way the ranking of nodes becomes two- 
dimensional so that each node i has both CheiRank K*{i) 
and PageRank K{i) (see Figs. I3I4|) . Such a ranking on 2D 
plane {K,K*) takes into account a combined effect of in- 
going (for PageRank) and outgoing (for CheiRank) links. 
The usual PageRank is obtained by projection of all nodes 
on horizontal axis of K and the ranking only by outgoing 
links via CheiRank is obtained by projecting all nodes on 
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In A' InK 
Fig. 4. (Color online) Left panel shows density distribution of 
Wikipedia articles W{K,K'') — dNi/dKdK* in the plane of 
PageRank and CheiRank indexes {\nK,\nK*) as in Fig. O 34 
green and red points show distribution of chess players, world 
champions are marked in red. Right panel shows density dis- 
tribution of = 3282257 articles obtained with independent 
probability distributions generated by P and P* given by the 
dependence from Fig. [S] Color choice is as in Fig. [H] 




Fig. 5. (Color online) Panel shows W{K,K*) density de- 
pendence from Fig. [3] on a parametric variable 77 which 
parametrizes the straight line \nK = xo+ri/2, \nK* = xo—'q/2 
with xq = 10 (blue curve)), 12 (red curve); dotted curves 
show densities from independent probabilities, of right panel 
of Fig. [4l taken at the same 77— lines. 



the vertical axis of K* . The data show that there are no 
articles which have small K and K* values simultaneously. 
We will discuss the rich properties of 2D ranking below in 
more detail. 

While PageRank characterizes a degree of knowledge 
and popularity of a given site z, CheiRank highlights its 
communication, influence and connectivity degree. These 
ranks have certain analogy to authorities and hubs appear- 
ing in the HITS algorithm 23 but the HITS is query de- 
pendent while the rank probabilities P{i) and P*{i) clas- 
sify all sites of the network. With these two qualitatively 
different characteristics the ranking of sites becomes two- 
dimensional (2D). 

The density distribution W{K,K*) = dNi/dKdK* of 
Wikipedia articles Ni in the plane {\x\K,\x\K*) is shown 
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Fig. 6. (Color online) Dependence of correlator k on a. Black 
circles, green squares and red triangles correspond to cases 
(a = a*), (fixed a = 0.85 and 0.5 < a* < 1) and (fixed 
a* — 0.85 and 0.5 < a < 1), respectively. In the main part of 
the paper we use only the case a = a* = 0.85. 



in Figs, [m (Y,, W{K,K*) = 1). Density distribution of 
Wikipedia articles W{K, K*) = dN./dKdK* is computed 
over equidistant grid in plane (In iiT, IniiT*) with 100 x 
100 cells, color shows average value of W in each cell, the 
normalization condition is W{K,K*) = 1. 

The densities obtained from the product of indepen- 
dent probabilities of P and P* generated by the distribu- 
tions of Fig. [5] give very different density W{K,K* ) with 
W being homogeneous along lines InK* = — InK + const. 
At large values of K,K* the distribution W{K,K*), ob- 
tained from independent probabilities, is constant, apart 
from small fluctuations, since the measure is given by 
dKdK* is homogeneous (see dotted curves in Fig. [5] and 
right panel in Fig. U]). However, at small values of K, K* 
the cells, which size decreases with K,K*, start to have 
small number of articles per cell (< 1) and fluctuations 
give nonhomogeneous behavior of W{K, K*). 

In contrast to the Linux network [TS], the Wikipedia 
network has a maximum of density along the line In K* « 
5 -I- (21ni4r)/3 that shows a strong correlation between 
P and P* (see Figs. I3l4p . Indeed, for Wikipedia we find 
that the correlator k between PageRank and CheiRank 
probabilities P{i) and P*{i) is rather high: 



N^P{z)P*{z) 



1 = 4.08 



(2) 



This value is much larger than for the Linux network 
where k w —0.05. Due to these correlations the distribu- 
tion W{K,K*) is absolutely different from the one given 
by independent product probabilities for PageRank index 
K and CheiRank index K* which is homogeneous along 
lines ln(i^*) -I- In(i^) = const (see Fig. |3] (right panel) 
and Fig[S]). Indeed, the real density of articles W{K,K*), 
taken along a given r;— line in the plane {\nK,\nK*), is 
log-normal and has a Gaussian form with a certain width 
as it is shown in Fig. [SJ 
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The value of correlator n for Wikipedia is comparable 
with the one of Cambridge University network (k — 3.79 
[TS]) but the probability distributions are different. De- 
pendencies of the correlator k, on damping parameters a 
and a* are shown in Fig. [S] (here a* is the damping pa- 
rameter for the Google matrix G* with inverted direction 
of links). For all realistic values of a and a* the correlator 
remains larger than unity showing that the correlation be- 
tween CheiRank and PageRank is a robust feature of the 
Wikipedia hyperlink network. 

From a physical view point a positive value of corre- 
lator K marks the tendency to have more outgoing and 
ingoing links between certain sets of nodes. This prop- 
erty is clearly visible for Cambridge University network, 
as discussed in [IF, and for Wikipedia network discussed 
here. On the other side small or negative values of k cor- 
respond to anti-correlation between ingoing and outgoing 
links, that looks like an effective repulsion between them. 
Such a situation appears in the Linux Kernel network |15j . 
Future more detailed studies of this correlator should be 
done for other types of networks to understand its prop- 
erties in a better way. 



3 Ranking of selected categories 

The difference between PageRank and CheiRank is clearly 
seen from the names of articles with highest rank (detailed 
data for all ranks and all categories considered are given 
in Appendix and [Mj). At the top of PageRank we have 
1. United States, 2. United Kingdom, 3. France while for 
CheiRank we find 1. PortahContents/ Outline of knowl- 
edge/Geography and places, 2. List of state leaders by year, 
3. PortabGontents/ Index/ Geography and places. Clearly 
PageRank selects first articles on a broadly known sub- 
ject with a large number of ingoing links while CheiRank 
selects first highly communicative articles with many out- 
going links. Since the articles are distributed in 2D they 
can be ranked in various ways corresponding to projec- 
tion of 2D set on a line. The horizontal and vertical lines 
correspond to PageRank and CheiRank. 

We also introduce 2DRank K2{i) to characterize mixed 
effect of CheiRank and PageRank. The list of K2{i) is con- 
structed by increasing K ^ K +1 and increasing 2DRank 
index i^2(«) by one if a new entry is present in the list of 
first K* < K entries of CheiRank, then the one unit step 
is done in K* and K2 is increased by one if the new entry 
is present in the list of first K < K* entries of CheiRank. 
More formally, 2DRank K2{i) gives the ordering of the 
sequence of sites, that appear inside the squares 
[1,1; K — k, K* = k; ...] when one runs progressively from 
fc = 1 to iV. In fact, at each step k ^ k + 1 there are tree 
possibilities: (i) no new sites on two edges of square, (ii) 
only one site is on these two edges and it is added in the 
hsting of K2{i) and (ni) two sites are on the edges and 
both are added in the listing K2{i), first with K > K* 
and second with K < K* . For (iii) the choice of order of 
addition in the list K2{i) affects only some pairs of neigh- 
boring sites and does not change the main structure of 
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Fig. 7. (Color online) A toy example illustrating the func- 
tioning of 2DRank algorithm of node ranking in K, K* plane: 
square size fc x fc is regularly increase in size fc — >■ fc + 1, nodes 
appearing on edges of this square at each step are included in 
the listing K2 of 2DRank (crosses), first on right edge, then on 
top edge; nodes outside of the square (circles) are included in 
the listing K2 at later stage. Numbers near symbols give K2 
values of 2DRank. 



ordering. This 2DRank algorithm is illustrated in Fig. [7] 
by a toy example. 

Wikipedia articles with highest 2DRank are 1. India, 
2. Singapore, 3. Pakistan. Thus, these articles are most 
known/popular and most communicative at the same time. 
We note that the ranking of Wikipedia articles by PageR- 
ank and HITS authorities has been discussed in a liter- 
ature (see e.g. [251126] ) but 2D analysis was never done 
before. 

To understand the properties of three ranks K, K2, K* 
in a better way we consider the three main categories 
of articles about countries, universities and personalities. 
The locations of 100 top internationally established ranks, 
taken according to 17,18,19j respectively, are shown in 
the Wikipedia rank plane (In if. In if*) in Fig. |3^,b,c. In 
average the points are distributed along the band of max- 
imal density of W in this plane. The same is valid for 
Dow- Jones companies (Fig. [3^) and physicists (Fig. |3ji) 
taken from the Wikipedia List of physicists (with few ad- 
ditions). We also show 100 top countries (Fig. [3^), 100 
top universities (Fig. [5}d) according to 2DRank and 100 
top personalities according to PageRank and CheiRank 
(Fig. [3]:). Distribution of chess players is shown in left 
panel of Fig. H) 

According to PageRank and 2DRank the chosen cat- 
egories are ordered as following: countries (category top 
rank values if = 1, if 2 = 1, K* — 107), universities 
[K = 180, K2 = 5, if* = 836), personalities (if = 268, 
if 2 ~ 233, if* — 33), Dow- Jones companies (if = 366, 
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Fig. 8. (Color online) Dependence of overlap fraction / of 
top Wikipedia articles for various categories on selected rank 
Ks taken from different internationally established sources: 
(a) SJR country documentation rank for 1996-2008 [IT!; (b) 
Shanghai university rank of 2009 [18]; (c) Hart's personality 
rank [T5]. Colors mark Wikipedia PageRank K (blue), 2DRank 
K2 (green) and CheiRank K* (red). In (a) the order of top 100 
countries is the same for 2DRank and CheiRank; in (b) the 
gray dotted curve shows also / for the rank given by advanced 
Google search of a word university in the Wikipedia English 
domain at May 19, 2010. 



K2 = 1082, K* = 14298), physicists {K = 505, K2 = 
1202, K* = 7622) and chess players {K = 23903, K2 = 
5354, K* = 35670). Such an ordering of these categories 
corresponds to a natural influence proportional to an effec- 
tive size of categories (e.g. countries are larger than univer- 
sities, which are larger than personalities etc.). Pleasantly 
enough, from the point of Wikipedia ranking, universities 
are much more influential than Dow- Jones companies. At 
the same time it is interesting that for chosen categories 
the communicative power measured by CheiRank is most 
high for personalities. 

We discuss the main features of three ranks if, K2,K* 
starting from countries. The first three countries are 1. 
United States, 2. United Kingdom, 3. France for PageR- 
ank and 1. India, 2. Singapore, 3. Pakistan for 2DRank and 
CheiRank. To determine in a quantitative way how accu- 
rate is this ranking we introduce the overlap fraction / de- 
fined as the relative number of same entries inside the first 
Kg entries of rank of selected countries, as stated in pTTl, 
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Fig. 9. (Color online) Dependence of window overlap fraction 
/u, for top Wikipedia articles on internationally established 
ranks from Fig. [8] respectively for panels (a), (b), (c) (points 
are placed at the middle of window interval, window size is 20 
nodes). 



and first Ks countries according to K,K2,K* ranking. 
The dependence f{Ks) is shown in Fig. We see that 
the Wikipedia PageRank reproduces in average about 80% 
of top countries selected by p2]- CheiRank and 2DRank 
do not reproduce the initial values of rank [T7] but reach 
around 70% at maximum Kg = 100. Indeed, CheiRank 
and 2DRank place at the top countries which are not so 
authoritative as United States but which are strongly con- 
nective due to historical (e.g. Egypt is at position 13) or 
tourist reasons (e.g. Thailand at , Malaysia at 7). Defi- 
nitely the first triplet of PageRank gives western coun- 
tries which are better known and historically more pow- 
erful. However, the first triplet of CheiRank and 2DRank 
shows that the communicative power of other countries, 
even such small as Singapure, dominates on the pages of 
Wikipedia. This may be a sign of future changes. Let us 
also note that People's Republic of China is only at 26 
position in 2DRank that can be attributed to non-English 
origin of the country and political separation of historical 
China (its SJR rank is 4). 

According to Wikipedia the top universities are 1. Har- 
vard University, 2. University of Oxford, 3. University of 
Cambridge in PageRank; 1. Columbia University, 2. Uni- 
versity of Florida, 3. Florida State University in 2DRank 
and CheiRank. The fraction f{Ks) for overlap with Shang- 
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hai university ranking (TS] is shown in FigjSjD. The Wikipedia 
PageRank reproduces in average around 70% of ranking 
[T5] that is about 10% higher than gives the advanced 
Google search. CheiRank and 2DRank at maximum give 
around 25% and 35%. These ranks highlight connectiv- 
ity degree of universities that leads to appearance of sig- 
nificant number of arts, religious and military special- 
ized colleges (12% and 13% respectively for CheiRank and 
2DRank) while PageRank has only 1% of them. CheiRank 
and 2DRank introduce also a larger number of relatively 
small universities. We argue that such colleges and univer- 
sities keeps links to their alumni in a significantly better 
way that increases their ranks. 

For personalities the Wikipedia ranking gives 1. Napoleon 
I of France, 2. George W. Bush, 3. Elizabeth II of the 
United Kingdomiov PageRank; I.Michael Jackson, 2. Frank 
Lloyd Wright, 3. David Bowie for 2DRank; 1. Kasey S. 
Pipes, 2. Roger Calmel, 3. Yury G. Chernavsky for CheiRank. 
The overlap fraction f{Ks) for Hart's personality ranking 
[12] is shown in Fig. [5]:. Even for the PageRank it is at 
maximum 35% being around 10% for 2DRank and almost 
zero for CheiRank. We attribute this to a very broad dis- 
tribution of personalities in 2D plane (Fig.[3J:) and a large 
variety of human activities which we classify by 5 main 
categories: politics, religion, arts, science, sport. For top 
100 PageRank personalities we have for these categories: 
58, 10, 17, 15, respectively. Clearly PageRank overesti- 
mates the significance of politicians. For 2DRank we find 
respectively 24, 5, 62, 7, 2. Thus this rank highlights artis- 
tic sides of human activity. For CheiRank we have 15, 
1, 52, 16, 16 so that the dominant contribution comes 
from arts, science and sport. The interesting property of 
this rank is that it selects many composers, singers, writ- 
ers, actors. As an interesting feature of CheiRank we note 
that among scientists it selects those who are not so much 
known to a broad public but who discovered new objects, 
e.g. George Lyell who discovered many Australian butter- 
flies or Nikolai Chernykh who discovered many asteroids. 
CheiRank also selects persons active in several categories 
of human activity. 

Of course, the overlap fraction / gives only the integral 
overlap between Wikipedia ranking and the selected inter- 
nationally recognized ranking. It is possible to get more 
reflned comparison taking the window fraction f^ of rel- 
ative number of names being the same inside a certain 
window of fixed size. We present such type of data for f^ 
in Fig. [S] for a window size of 20 nodes. For countries and 
universities the values of f^ are rather high for top 40 
ranks but for higher ranks there is a gradual decrease of 
fw We attribute this to increasing fluctuations at large 
values of rank where each item can move in a signiflcant 
interval. For personalities the window fraction f^ remains 
rather low. We attribute this to a very broad variety of 
human activity as discussed above. 

When a human activity is fixed in a more precise way 
then the Wikipedia ranking gives a rather reliable order- 
ing. For example, for 754 physicists (see Fig. [3ji) we find 
at the top: 1. Aristotle, 2. Albert Einstein, 3. Isaac New- 
ton from PageRank; 1. Albert Einstein, 2. Nikola Tesla, 
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Fig. 10. (Color online) (a) Overlap fraction / of Nobel laureate 
physicists as a function of rank of physicists via Wikipedia 
articles; (b) window overlap fraction for the data of panel 
(a), points are placed at the middle of window interval, window 
size is 20 nodes. Colors mark Wikipedia PageRank K (blue), 
2DRank K2 (green) and CheiRank K* (red) which ranks are 
plotted on horizontal axis. 



3. Benjamin Franklin from 2DRank; 1. Hubert Reeves, 2. 
Shen Kuo, 3. Stephen Hawking from CheiRank. It is clear 
that PageRank gives most known, 2DRank gives most 
known and active in other areas, CheiRank gives those 
who are known and contribute to popularization of sci- 
ence. Indeed, e.g. Hubert Reeves and Stephen Hawking are 
very well known for their popularization of physics that 
increases their communicative power and place them at 
the top of CheiRank. Shen Kuo obtained recognized re- 
sults in an enormous variety of fields of science that leads 
to the second top position in CheiRank even if his activity 
was about thousand years ago. 

For physicists, who lived at the time of Nobel prize 
and could get it, we can determine the overlap fraction / 
as a relative number of Nobel laureates at a given rank 
value K,K2,K* (see curves in Fig. [TOk). In certain as- 
pects, this definition of / for physicists is different from 
the case of categories in Fig. |5] since here Nobel laureates 
are in the same listing of Wikipedia ranks of physicists 
while in Fig.[5]the rank Ks is taken from independent clas- 
sification source. The data for Nobel laureates in Fig. [TOk 
give high average values of / w 0.5 for PageRank and 
2DRank, and / w 0.25 for CheiRank. We note that the 
number of Nobel prizes is not so large and even very no- 
table physicists remained without it (e.g. Thomas Edison, 
Nikola Tesla, Alexander Graham Bell are those from the 
top of PageRank who remained without the prize). Hence, 
the prediction level of / « 0.5 can be considered as rather 
high. 

The window overlap fraction f^ for Nobel laureates is 
shown in Fig. llOb . here inside the window gives the 
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relative fraction of Nobel laureates without coincidence 
of concrete names (Nobel laureates form a sublisting of 
listing of physicists; note that in Fig. IHlwe look overlap of 
same article titles from a pair of different listings). Even 
at large rank values remains on the level of 40-50% 
that is twice higher than the average fraction of Nobel 
laureates in the list of physicists < / >= 193/754 w 0.25. 
This shows that Wikipedia ranking have high correlations 
with the list selected by the Nobel committee. 

The ranking of 193 Nobel laureate physicists give 1. Al- 
bert Einstein, 2. Enrico Fermi, 3. Richard Feynman from 
PageRank; 1. Albert Einstein, 2. Richard Feynman, 3. Wer- 
ner Heisenberg from 2DRank and CheiRank. The commu- 
nication power is well visible if to consider certain specific 
names, e.g. Abdus Salam has PageRank 20 and CheiRank 
5. It is clear that the communicative power of this article 
is strongly enhanced due to existence of the Abdus Salam 
International Centre for Theoretical Physics. 

Thus Wikipedia ranking can be applied to various cat- 
egories giving rather good results without any fitting. For 
example, for Dow- Jones companies we have 1. Microsoft, 

2. IBM, 3. The Walt Disney Company from PageRank and 
1. Cisco Systems, 2. The Walt Disney Company, 3. Mi- 
crosoft from CheiRank. CheiRank correctly selects Cisco 
Systems, the worldwide leader in networking, at the top 
of communication power of Dow-Jones listing. 

For chess players we find: 1. Garry Kasparov, 2. Bobby 
Fischer, 3. Alexander Alekhine bom PageRank 1. Bobby 
Fischer, 2. Alexander Alekhine, 3. Emanuel Lasker from 
2DRank K2 and 1. Bobby Fischer, 2. Alexander Alekhine, 

3. Wilhelm Steinitz from CheiRank K* . Thus world chess 
champions, including most strong, appear at the top po- 
sitions. Indeed, according to FIDE 27 Garry Kasparov is 
ranked number 1 most times. Clearly a lot of hot stories 
around Bobby Fischer increased his 2DRank and CheiRank. 



4 Discussion 

On the basis of presented results we conclude that the 
ranking of Wikipedia articles allows to rank human knowl- 
edge in a rather reliable way. The 2D ranking highlights 
the properties of articles in a new rich and fruitful manner. 
We think that such type of 2D ranking will find further 
useful applications for various complex networks including 
the WWW. 

It would be also interesting to apply the 2D ranking 
to citation network of Physical Review where PageRank 
is known to give a reliable ranking of physicists |28i[29] . 
We suppose that for this network CheiRank will allow 
to select notable reviews which played an important role 
in the propagation of knowledge between various fields of 
physics. It is possible that combination of 2D ranking with 
other methods of various entities ranking of Wikipedia can 
be also useful (see e.g. [50]). 

Wikipedia becomes the largest library of human knowl- 
edge. "The Library exists ab aeterno" declared Jorge Luis 
Borges [21] • Thus, the ranking of this enormous amount of 
knowledge becomes a formidable challenge and we think 



that the 2D ranking will play for this task a useful and 
important role. 



5 Appendix 

Below we give the listings of Wikipedia articles in their 
rank order according to the three ranks discussed in the 
paper for the categories discussed. More detailed data are 
available at |24) . 

LI) Top 20 Wikipedia articles: 

PageRank: 1. United States, 2. United Kingdom, 3. 
France, 4. Germany, 5. England, 6. Canada, 7. World War 

11, 8. Australia, 9. India, 10. Japan, 11. English Language, 
12.1taly, 13. Association footbah, 14. 2007, 15. London, 16. 
Poland, 17. Animal, 18. List of sovereign states, 19. 2008, 
20. New York City. 

2DRank: 1. India, 2. Singapore, 3. Pakistan, 4. Brazil, 
5. Columbia University, 6. Thailand, 7. Amphibian, 8. 
Chile, 9. Fossil, 10. Malaysia, 11. Yorkshire, 12. New York 
Yankees, 13. Virginia, 14. Iceland, 15. California, 16. Eng- 
land, 17. New Jersey, 18. Michigan, 19. Hong Kong, 20. 
Municipalities of Switzerland. 

CheiRank: 1. Portal: Contents/Outline of knowledge/- 
Gcography and places, 2. List of state leaders by year, 
3. Portal: Contents/Index/Geography and places, 4. Lists 
of country-related topics, 5. PortahMathematics/Index, 6. 
List of United Kingdom locations, 7. Navy Office of Com- 
munity Outreach, 8. PortahSpaceflight/Status, 9. List of 
Tachinidae genera and species, 10. Village Development 
Committee, 11. Tachinidae, 12. Navy Weeks, 13. Lists of 
birds by region, 14. Outline of Africa, 15. List of Bulbo- 
phyllum species, 16. Bulbophyllum, 17. Index of Thailand- 
related articles, 18. PortahMiddle-earth/Pages, 19. Por- 
tahTrains/ Anniversaries, 20. PortahContents/Outline of 
knowledge/History and events. 

L2) Top 20 articles in category countries: 

PageRank: 1. United States, 2. United Kingdom, 3. 
France, 4. Germany, 5. Canada, 6. Australia, 7. India, 
8. Japan, 9. Italy, 10. Poland, 11. Spain, 12. Russia, 13. 
Netherlands, 14. Brazil, 15. New Zealand, 16. Sweden, 17. 
Romania, 18. Switzerland, 19. Mexico, 20. Norway. 

2DRank and CheiRank give the same order: 1. India, 2. 
Singapore, 3. Pakistan, 4. Brazil, 5. Thailand, 6. Chile, 7. 
Malaysia, 8. Iceland, 9. Hong Kong. 10. Canada, 11. Peru, 

12. Afghanistan, 13. Egypt, 14. Armenia, 15. Argentina, 
16. United Kingdom, 17. Colombia, 18. Finland, 19. Benin, 
20. Bolivia. 

United States are at rank 41. Top 100 countries have 
the same order in 2DRank and CheiRank. We attribute 
this to a rather sparse distribution in CheiRank index 
between countries. SJR country documentation rank for 
1996-2008 is available at [17]. 

L3) Top 20 articles in category universities: 
PageRank: 1. Harvard University, 2. University of Ox- 
ford, 3. University of Cambridge, 4. Columbia University, 
5. Yale University, 6. Massachusetts Institute of Technol- 
ogy, 7. Stanford University, 8. University of California, 
Berkeley, 9. Princeton University, 10. Cornell University, 
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11. University of Chicago, 12. University of Michigan, 13. 
University of California, Los Angeles, 14. University of 
Pennsylvania, 15. New York University, 16. University of 
Texas at Austin, 17. University of Toronto, 18. Univer- 
sity of Southern California, 19. University of Virginia, 20. 
University of Florida. 

2DRank: 1. Columbia University, 2. University of Florida, 
3. Florida State University, 4. University of California, 
Berkeley, 5. Northwestern University, 6. Brown Univer- 
sity, 7. University of Southern California, 8. Carnegie Mel- 
lon University, 9. Massachusetts Institute of Technology, 
10. University of Michigan, 11. Georgetown University, 12. 
Juilliard School, 13. University of Pittsburgh, 14. Amherst 
College, 15. Cornell University, 16. Durham University, 17. 
Rutgers University, 18. Monash University, 19. The Uni- 
versity of Western Ontario, 20. University of Toronto. 

CheiRank: 1. Columbia University, 2. University of 
Florida, 3. Florida State University, 4. Brooklyn College, 
5. Amherst College, 6. The University of Western On- 
tario, 7. University of Sheffield, 8. University of Califor- 
nia, Berkeley, 9. Northwestern University, 10. Northeast- 
ern University, 11. Brown University, 12. Queen's Univer- 
sity of Belfast (Northern Ireland Parliament constituency) , 
13. Fairfield University, 14. University of Southern Califor- 
nia, 15. Carnegie Mellon University, 16. Grambling State 
University, 17. Massachusetts Institute of Technology, 18. 
University of Michigan, 19. Georgetown University, 20. 
University of Pittsburgh. 

Rank given by advanced Google search in Wikipedia 
English domain at May 19, 2010: 1. Princeton Univer- 
sity, 2. Columbia University, 3. University of Oxford, 4. 
Stanford University, 5. Harvard University, 6. University 
of Cambridge, 7. Cornell University, 8. University of Cali- 
fornia, Berkeley, 9. Yale University, 10. University of Vir- 



6. Landon Donovan, 7. Marilyn C. Solvay, 8. Matt Kel- 
ley, 9. Johann Georg Hagen, 10. Chikage Oogi, 11. Bobbie 
Vaile, 12. Rosie Malek-Yonan, 13. Blythe McGarvie, 14. 
Djoko Hardono, 15. Cristina Bella, 16. Sid Deuce, 17. Joey 
Hamilton, 18. Kiki Dee, 19. Carlos Francis, 20. Percy Jew- 
ett Burrell. 

Hart's personality rank is available at [19 and at 
http: / / w ww.adhcrcnts.com/adh_influ.html . 
L5) Top 20 articles in category physicists: 
PageRank: 1. Aristotle, 2. Albert Einstein, 3. Isaac 
Newton, 4. Thomas Edison, 5. Benjamin Franklin, 6. Got- 
tfried Leibniz, 7. Avicenna, 8. Carl Friedrich Gauss, 9. 
Galileo Galilei, 10. Nikola Tesla, 11. Andre-Marie Ampere, 
12. Michael Faraday, 13. Leonhard Euler, 14. Alexander 
Graham Bell, 15. James Clerk Maxwell, 16. Archimedes, 
17. Blaise Pascal, 18. Stephen Hawking, 19. Enrico Fermi, 
20. Johannes Kepler. 

2DRank: 1. Albert Einstein, 2. Nikola Tesla, 3. Ben- 
jamin Franklin, 4. Avicenna, 5. Isaac Newton, 6. Thomas 
Edison, 7. Stephen Hawking, 8. Gottfried Leibniz, 9. Richard 
Feynman, 10. Aristotle, 11. Alhazen (Ibn al-Haytham) 
Iraq, 12. Werner Heisenberg, 13. Heinrich Hertz, 14. Jo- 
hannes Kepler, 15. Galileo Galilei, 16. Shen Kuo, 17. Abu 
Rayhan Biruni- Persian, 18. Alexander Graham Bell, 19. 
Robert Hooke, 20. Michael Faraday. 

CheiRank: 1. Hubert Reeves, 2. Shen Kuo, 3. Stephen 
Hawking, 4. Nikola Tesla, 5. Albert Einstein, 6. Arthur 
Stanley Eddington, 7. Richard Feynman, 8. John Joseph 
Montgomery, 9. Josiah Willard Gibbs, 10. Heinrich Hertz, 
11. Benjamin Franklin, 12. Edwin Hall, 13. Avicenna, 14. 
Isaac Newton, 15. Thomas Edison, 16. Michio Kaku, 17. 
Abu Rayhan Biruni- Persian, 18. Abdul Qadeer Khan, 19. 
Werner Heisenberg, 20. Gottfried Leibniz. 

L6) Top 20 articles in category Nobel laureate physi- 



ginia, 11. Northwestern University, 12. University of Chicago,cists (physicists who got any Nobel prize 



13. Brown University, 14. University of Michigan, 15. Rut- 
gers University, 16. University of Washington, 17. Indiana 
University, 18. University of Minnesota, 19. Howard Uni- 
versity, 20. Leiden University. 

Shanghai university rank of 2009 is available at [ISj. 
L4) Top 20 articles in category personalities: 
PageRank: 1. Napoleon I of France, 2. George W. Bush, 
3. Elizabeth II of the United Kingdom, 4. William Shake- 
speare, 5. Carl Linnaeus, 6. Adolf Hitler, 7. Aristotle, 8. 
Bill Clinton, 9. Franklin D. Roosevelt, 10. Ronald Reagan, 
11. Barack Obama, 12. Richard Nixon, 13. George Wash- 
ington, 14. Joseph Stalin, 15. Abraham Lincoln, 16. John 

F. Kennedy, 17. Muhammad, 18. Winston Churchill, 19. 
Henry VIII of England, 20. Alexander the Great. 

2DRank: 1. Michael Jackson, 2. Frank Lloyd Wright, 
3. David Bowie, 4. Hillary Rodham Clinton, 5. Charles 
Darwin, 6. Stephen King, 7. Richard Nixon, 8. Isaac Asi- 
mov, 9. Frank Sinatra, 10. Elvis Presley, 11. Edward El- 
gar, 12. Stephen Sondheim, 13. Agatha Christie, 14. Pope 
John Paul II, 15. Robert A. Heinlein, 16. Adolf Hitler, 
17. Madonna (entertainer), 18. Ozzy Osbourne, 19. John 
McCain, 20. Jesus. 

CheiRank: 1. Kasey S. Pipes, 2. Roger Calmel, 3. Yury 

G. Chernavsky, 4. Josh Billings (pitcher), 5. George Lyell, 



PageRank: 1. Albert Einstein, 2. Enrico Fermi, 3. Richard 
Feynman, 4. Max Planck, 5. Guglielmo Marconi, 6. Werner 
Heisenberg, 7. Marie Curie, 8. Niels Bohr, 9. Paul Dirac, 
10. J.J.Thomson, 11. Max Born, 12. John Strutt, 3rd Baron 
Rayleigh, 13. Andrei Sakharov, 14. Pierre Curie, 15. Sub- 
rahmanyan Chandrasekhar, 16. Wolfgang Pauli, 17. Lev 
Landau, 18. Eugene Wigner, 19. Albert Abraham Michel- 
son, 20. Abdus Salam. 

2DRank: 1. Albert Einstein, 2. Richard Feynman, 3. 
Werner Heisenberg, 4. Enrico Fermi, 5. Max Born, 6. Marie 
Curie, 7. Wolfgang Pauli, 8. Max Planck, 9. Eugene Wigner, 
10. Paul Dirac, 11. Guglielmo Marconi, 12. Abdus Salam, 
13. Hans Bethe, 14. Andrei Sakharov, 15. Steven Chu, 16. 
Niels Bohr, 17. J.J.Thomson, 18. Steven Weinberg, 19. 
Peter Debye, 20. Subrahmanyan Chandrasekhar. 

CheiRank: 1. Albert Einstein, 2. Richard Feynman, 3. 
Werner Heisenberg, 4. Brian David Josephson, 5. Abdus 
Salam, 6. C.V.Raman, 7. Peter Debye, 8. Enrico Fermi, 9. 
Wolfgang Pauli, 10. Steven Weinberg, 11. Max Born, 12. 
Eugene Wigner, 13. Marie Curie, 14. Luis Walter Alvarez, 
15. Percy Williams Bridgman, 16. Roy J. Glauber, 17. 
Max Planck, 18. Paul Dirac, 19. Guglielmo Marconi, 20. 
Hans Bethe. 

L7) Top 30 articles in category chess players: 
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PageRank: 1. Garry Kasparov, 2. Bobby Fischer, 3. 
Alexander Alekhine, 4. Anatoly Karpov, 5. Emanuel Lasker, 
6. Mikhail Botvinnik, 7. Vladimir Kramnik, 8. Viswanathan 
Anand, 9. Paul Keres, 10. Boris Spassky, 11. Veselin Topalov 
12. Wilhelm Steinitz, 13. Tigran Petrosian, 14. Max Euwe, 
15. David Bronstein, 16. Mikhail Tal, 17. Viktor Korch- 
noi, 18. Vasily Smyslov, 19. Samuel Reshevsky, 20. Bent 
Larsen, 21. Jose Raul Capablanca, 22. Boris Gelfand, 23. 
Gata Kamsky, 24. Alexei Shirov, 25. Mark Taimanov, 26. 
Magnus Carlsen, 27. Efim Geller, 28. Ruslan Ponomariov, 
29. Rustam Kasimdzhanov, 30. Alexander Khalifman. 

2DRank: 1. Bobby Fischer, 2. Alexander Alekhine, 3. 
Emanuel Lasker, 4. Garry Kasparov, 5. Wilhelm Steinitz, 
6. Paul Keres, 7. Mikhail Botvinnik, 8. Jose Raul Capa- 
blanca, 9. Bent Larsen, 10. Boris Spassky, 11. Viswanathan 
Anand, 12. Magnus Carlsen, 13. Vladimir Kramnik, 14. 
Efim Geller, 15. Samuel Reshevsky, 16. Anatoly Karpov, 
17. Mikhail Tal, 18. Viktor Korchnoi, 19. Max Euwe, 20. 
Veselin Topalov, 21. Gata Kamsky, 22. Tigran Petrosian, 

23. David Bronstein, 24. Rustam Kasimdzhanov, 25. Rus- 
lan Ponomariov, 26. Vasily Smyslov, 27. Alexei Shirov, 28. 
Boris Gelfand, 29. Wolfgang Unzicker, 30. Mark Taimanov. 

CheiRank: 1. Bobby Fischer, 2. Alexander Alekhine, 
3. Wilhelm Steinitz, 4. Emanuel Lasker, 5. Garry Kas- 
parov, 6. Paul Keres, 7. Mikhail Botvinnik, 8. Jose Raul 
Capablanca, 9. Magnus Carlsen, 10. Bent Larsen, 11. Boris 
Spassky, 12. Viswanathan Anand, 13. Vladimir Kramnik, 

14. Efim Geller, 15. Samuel Reshevsky, 16. Anatoly Kar- 
pov, 17. Mikhail Tal, 18. Viktor Korchnoi, 19. Max Euwe, 
20. Veselin Topalov, 21. Gata Kamsky, 22. Tigran Pet- 
rosian, 23. David Bronstein, 24. Rustam Kasimdzhanov, 
25. Ruslan Ponomariov, 26. Vasily Smyslov, 27. Alexei 
Shirov, 28. Boris Gelfand, 29. Wolfgang Unzicker, 30. Mark 
Taimanov. 

L8) Ranking of 30 Dow- Jones companies: 

PageRank: 1. Microsoft, 2. IBM, 3. The Walt Dis- 
ney Company, 4. Intel Corporation, 5. Hewlett-Packard, 
6. General Electric, 7. Mcdonald's, 8. Boeing, 9. At&t, 10. 
Cisco Systems, 11. DuPont, 12. ExxonMobil, 13. Procter 
& Gamble, 14. Bank of America, 15. Verizon Communi- 
cations, 16. JPMorgan Chase, 17. American Express, 18. 
Pfizer, 19. The Coca-Cola Company, 20. American Ex- 
press, 21. Chevron Corporation, 22. 3M, 23. Merck & Co., 

24. The Home Depot, 25. Alcoa, 26. Johnson & Johnson, 
27. Kraft Foods, 28. Caterpillar Inc., 29. United Technolo- 
gies Corporation, 30. The Travelers Companies. 

2DRank and CheiRank have the same order: 1. Cisco 
Systems, 2. The Walt Disney Company, 3. Microsoft, 4. 
Kraft Foods, 5. IBM, 6. At&t, 7. Hewlett-Packard, 8. Pfizer, 
9. Intel Corporation, 10. ExxonMobil, 11. Caterpillar Inc., 
12. DuPont, 13. General Electric, 14. American Express, 

15. Johnson & Johnson, 16. Boeing, 17. Wal-Mart, 18. 
Bank of America, 19. Verizon Communications, 20. JP- 
Morgan Chase, 21. Merck k Co., 22. The Coca-Cola Com- 
pany, 23. 3M, 24. Procter & Gamble, 25. The Home De- 
pot, 26. Mcdonald's, 27. Alcoa, 28. Chevron Corporation, 
29. United Technologies Corporation, 30. The Travelers 
Companies. 
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